DO 





CO 

RAW SEQUENCE LISTING DATE: 11/29/2001 7^ 

PATENT APPLICATION: US/09/847 , 809A TIME: 14:05:27 § 

Input Set : N:\Crf3\RULE60\09847809A.txt 

^ Output Set: N:\CRF3\11292001\I847809A.raw 

SEQUENCE LISTING 

C--> I (1) GENERAL INFORMATION: 

^ I (i) APPLICANT: Sandman , Olga 

3 Hillman/ Jennifer L. 

9 Lai, Preeti 

,0 Cor ley, Neil C. 

jl Shah; Purvi 

Is (11) TITLE OF INVENTION: HUMAN RETICULOCALBIN ISOFORMS 

15 (iii) NUMBER OF SEQUENCES: 6 

[7 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

19 (B) STREET: 3174 Porter Drive 

20 (C) CITY: Palo Alto 

21 (D) STATE: CA 
,22 (E) COUNTRY: USA 
'23 (F) ZIP: 94304 

25 (V) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Diskette 

27 (B) COMPUTER: IBM Compatible 

28 (C) OPERATING SYSTEM: DOS 

29 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
31 (vi) CURRENT APPLICATION DATA: 

C--> 32 (A) APPLICATION NUMBER: US/09/847 , 809A 

C--> 33 (B) FILING DATE: Ol-May-2001 

34 (C) CLASSIFICATION: 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: 09/270,270 

38 (B) FILING DATE: 1999-03-16 

40 (viii) ATTORNEY/AGENT INFORMATION: 

41 (A) NAME: Billings, Lucy J. 

42 (B) REGISTRATION NUMBER: 36,749 

43 (C) REFERENCE/DOCKET NUMBER: PF-0358 US 
4 5 (ix) TELECOMMUNICATION INFORMATION: 

46 (A) TELEPHONE: 650-855-0555 

47 (B) TELEFAX: 650-845-4166 

48 (C) TELEX: 
51 (2) INFORMATION FOR SEQ ID NO : 1: 

53 (i) SEQUENCE CHARACTERISTICS: 

54 (A) LENGTH: 328 amino acids 

55 (B) TYPE: amino acid 

56 (C) STRANDEDNESS: single 

57 (D) TOPOLOGY: linear 
59 (vii) IMMEDIATE SOURCE: 
6 0 (A) LIBRARY: RATRNOT02 
61 (B) CLONE: 922578 

6 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

6 5 Met Met Trp Arg Pro Ser Val Leu Leu Leu Leu Leu Leu Leu Arg His 
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66 1 5 10 15 

67 Gly Ala Gin Gly Lys Pro Ser Pro Asp Ala Gly Pro His Gly Gin Gly 

68 20 25 30 

69 Arg Val His Gin Ala Ala Pro Leu Ser Asp Ala Pro His Asp Asp Ala 

70 35 40 45 

71 His Gly Asn Phe Gin Tyr Asp His Glu Ala Phe Leu Gly Arg Glu Val 

72 50 55 60 

73 Ala Lys Glu Phe Asp Gin Leu Thr Pro Glu Glu Ser Gin Ala Arg Leu 

74 65 70 75 80 

75 Gly Arg lie Val Asp Arg Met Asp Arg Ala Gly Asp Gly Asp Gly Trp 

76 85 90 95 

77 Val Ser Leu Ala Glu Leu Arg Ala Trp lie Ala His Thr Gin Gin Arg 

78 100 105 110 

79 His lie Arg Asp Ser Val Ser Ala Ala Trp Asp Thr Tyr Asp Thr Asp 

80 115 120 125 

81 Arg Asp Gly Arg Val Gly Trp Glu Glu Leu Arg Asn Ala Thr Tyr Gly 

82 130 135 140 

83 His Tyr Ala Pro Gly Glu Glu Phe His Asp Val Glu Asp Ala Glu Thr 

84 145 150 155 160 

85 Tyr Lys Lys Met Leu Ala Arg Asp Glu Arg Arg Phe Arg Val Ala Asp 

86 165 170 175 

87 Gin Asp Gly Asp Ser Met Ala Thr Arg Glu Glu Leu Thr Ala Phe Leu 

88 180 185 190 

89 His Pro Glu Glu Phe Pro His Met Arg Asp lie Val lie Ala Glu Thr 

90 195 200 205 

91 Leu Glu Asp Leu Asp Arg Asn Lys Asp Gly Tyr Val Gin Val Glu Glu 

92 210 215 220 

93 Tyr lie Ala Asp Leu Tyr Ser Ala Glu Pro Gly Glu Glu Glu Pro Ala 

94 225 230 235 240 

95 Trp Val Gin Thr Glu Arg Gin Gin Phe Arg Asp Phe Arg Asp Leu Asn 

96 245 250 255 

97 Lys Asp Gly His Leu Asp Gly Ser Glu Val Gly His Trp Val Leu Pro 

98 260 265 270 

99 Pro Ala Gin Asp Gin Pro Leu Val Glu Ala Asn His Leu Leu His Glu 

100 275 280 285 

101 Ser Asp Thr Asp Lys Asp Gly Arg Leu Ser Lys Ala Glu lie Leu Gly 

102 290 295 300 

103 Asn Trp Asn Met Phe Val Gly Ser Gin Ala Thr Asn Tyr Gly Glu Asp 

104 305 310 315 320 

105 Leu Thr Arg His His Asp Glu Leu 

106 325 

108 (2) INFORMATION FOR SEQ ID NO : 2: 

110 (i) SEQUENCE CHARACTERISTICS: 

111 (A) LENGTH: 1463 base pairs 

112 (B) TYPE: nucleic acid 

113 (C) STRANDEDNESS: single 

114 (D) TOPOLOGY: linear 

116 (vii) IMMEDIATE SOURCE: 

117 (A) LIBRARY: RATRNOT02 
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118 (B) CLONE: 922578 

120 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

122 CGCAGAGCGG ACGTGGAGAG CGGACGNCAG CTGGATAACA 

123 CCATCAGTTC TGCTGCTTCT GTTGCTACTG AGGCACGGGG 

124 GACGCAGGCC CTCATGGCCA GGGGAGGGTG CACCAGGCGG 
12 5 CATGATGACG CCCACGGGAA CTTCCAGTAC GACCATGAGG 

126 GCCAAGGAAT TCGACCAACT CACCCCAGAG GAAAGCCAGG 

127 GACCGCATGG ACCGCGCGGG GGACGGCGAC GGCTGGGTGT 

128 TGGATCGCGC ACACGCAGCA GCGGCACATA CGGGACTCGG 

129 TACGACACGG ACCGCGACGG GCGTGTGGGT TGGGAGGAGC 

130 CACTACGCGC CCGGTGAAGA ATTTCATGAC GTGGAGGATG 

131 CTGGCTCGGG ACGAGCGGCG TTTCCGGGTG GCCGACCAGG 

132 CGAGAGGAGC TGACAGCCTT CCTGCACCCC GAGGAGTTCC 

133 ATTGCTGAAA CCCTGGAGGA CCTGGACAGA AACAAAGATG 

134 TACATCGCGG ATCTGTACTC AGCCGAGCCT GGGGAGGAGG 

135 GAGAGGCAGC AGTTCCGGGA CTTCCGGGAT CTGAACAAGG 

136 GAGGTGGGCC ACTGGGTGCT GCCCCCTGCC CAGGACCAGC 

137 CTGCTGCACG AGAGCGACAC GGACAAGGAT GGGCGGCTGA 

138 AATTGGAACA TGTTTGTGGG CAGTCAGGCC ACCAACTATG 

139 CACGATGAGC TGTGAGCACC GCGCACCTGC CACAGCCTCA 

140 AGGAGGGGCC GCTGTGGTCT GGCCCCCTCC CTGTCCAGGC 

141 TCCCAGGCAT CCTCCTGCCC CTGGGCTCTC AGGGACCCCC 

142 TCACACCCCC AACCCCAGGG AGGGGCTGTC ATAGTCCCAG 
14 3 CTGACTGAGT CTCCCAGCCC AGACCCAGGG ACCCTTGGCC 
144 CGCCCCAACC CCTCCAGCTC CAAATCTGAG CCTCCACCAC 
14 5 CCCCAGCCCT CTCCTGCCTG GCCTGGCCTG GGACACCTCC 
14 6 AAGCCAGCGC CGGGAAAAAA AAA 

14 8 (2) INFORMATION FOR SEQ ID NO : 3: 

150 (i) SEQUENCE CHARACTERISTICS: 

151 (A) LENGTH: 315 amino acids 

152 (B) TYPE: amino acid 

153 (C) STRANDEDNESS: single 

154 (D) TOPOLOGY: linear 

156 (vii) IMMEDIATE SOURCE: 

157 (A) LIBRARY: BLADNOT03 

158 (B) CLONE: 1601793 

160 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

162 Met Asp Leu Arg Gin Phe Leu Met Cys Leu Ser 

163 15 10 

164 Ala Leu Ser Lys Pro Thr Glu Lys Lys Asp Arg 

165 20 25 

166 Gin Leu Ser Asp Lys Val His Asn Asp Ala Gin 

167 35 40 

168 His Asp Ala Phe Leu Gly Ala Glu Glu Ala Lys 

169 50 55 

170 Thr Pro Glu Glu Ser Lys Glu Arg Leu Gly Lys 

171 65 70 75 

172 Asp Gly Asp Lys Asp Gly Phe Val Thr Val Asp 

173 85 90 



GGGGACCGAT 
CCCAGGGGAA 
CCCCCCTGAG 
CTTTCCTGGG 
CCCGTCTGGG 
CGCTGGCCGA 
TGAGCGCGGC 
TGCGCAACGC 
CAGAGACCTA 
ATGGGGACTC 
CTCACATGCG 
GCTATGTCCA 
AGCCGGCGTG 
ATGGGCACCT 
CCCTGGTGGA 
GCAAAGCGGA 
GCGAGGACCT 
GAGGCCCGCA 
CCCGCAGGAG 
TGGGTCGGCT 
AGGATAAGCA 
CCAAGCTCAG 
ATAGACTGAA 
TCTCTGCCAG 



GATGTGGCGA 
GCCATCCCCA 
CGACGCTCCC 
ACGGGAAGTG 
GCGGATCGTG 
GCTTCGCGCG 
CTGGGACACG 
CACCTATGGC 
CAAAAAGATG 
GATGGCCACT 
GGACATCGTG 
GGTGGAGGAG 
GGTGCAGACG 
GGATGGGAGT 
AGCCAACCAC 
AATCCTGGGT 
GACCCGGCAC 
CAATGACCGG 
GCAGATGCAG 
TCTGTCCCTG 
ATACCTATTT 
CTCTAAGAAC 
ACTCCCCTGG 
GAGGCAATAA 



Leu Cys Thr Ala Phe 
15 

Val His His Glu Pro 
30 

Ser Phe Asp Tyr Asp 
45 

Thr Phe Asp Gin Leu 
60 

He Val Ser Lys He 
80 

Glu Leu Lys Asp Trp 
95 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1463 
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INFORMATION FOR SEQ ID NO : 4: 
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SEQUENCE 


: CHARACTERISTICS 
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(A) 


LENGTH 


: 2658 base pairs 






















(B) 


TYPE: nucleic acid 




















^ V ^ 






(C) 


STRANDEDNESS: single 
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(D) 


TOPOLOGY: linear 




















^ X ^ 


(vii) 


IMMEDIATE SOURCE: 






















91 
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(A) 


LIBRARY : BLADNOTO 3 




















9 1 A 

^ X *x 






(B) 


CLONE : 


1601793 






















z J. b 


(xi) 


SEQUENCE 


! DESCRIPTION: SEQ ID NO: 


; 4: 














218 


CCCGCTTCCG GTTGGGCGGT GCTTGCGCGC 


GTGAGCTGAG 


CCGGTGGGTG AGCGGCGGCC 


60 


219 


ACGGCATCCT GTGCTGTGGG GGCTACGAGG 


AAAGATCTAA 


TTATCATGGA CCTGCGACAG 


120 


220 


TTTCTTATGT GCCTGTCCCT GTGCACAGCC 


TTTGCCTTGA 


GCAAACCCAC AGAAAAGAAG 


180 


221 


GACCGTGTAC ATCATGAGCC TCAGCTCAGT 


GACAAGGTTC 


ACAATGATGC TCAGAGTTTT 


240 


222 


GATTATGACC ATGATGCCTT CTTGGGTGCT 


GAAGAAGCAA 


AGACCTTTGA TCAGCTGACA 


300 


223 


CCAGAAGAGA GCAAGGAAAG GCTTGGAAAG 


ATTGTAAGTA 


AAATAGATGG CGACAAGGAC 


360 


224 


GGGTTTGTCA CTGTGGATGA GCTCAAAGAC 


TGGATTAAAT 


TTGCACAAAA GCGCTGGATT 


420 


225 


TACGAGGATG TAGAGCGACA GTGGAAGGGG 


CATGACCTCA 


ATGAGGACGG CCTCGTTTCC 


480 


226 


TGGGAGGAGT ATAAAAATGC CACCTACGGC 


TACGTTTTAG 


ATGATCCAGA TCCTGATGAT 


540 


227 


GGATTTAACT ATAAACAGAT GATGGTTAGA 


GATGAGCGGA 


GGTTTAAAAT GGCAGACAAG 


600 


228 


GATGGAGACC TCATTGCCAC CAAGGAGGAG 


TTCACAGCTT 


TCCTGCACCC TGAGGAGTAT 


660 
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229 GACTACATGA AAGATATAGT AGTACAGGAA ACAATGGAAG ATATAGATAA GAATGCTGAT 

230 GGTTTCATTG ATCTAGAAGA GTATATTGGT GACATGTACA GCCATGATGG GAATACTGAT 

231 GAGCCAGAAT GGGTAAAGAC AGAGCGAGAG CAGTTTGTTG AGTTTCGGGA TAAGAACCGT 

232 GATGGGAAGA TGGACAAGGA AGAGACCAAA GACTGGATCC TTCCCTCAGA CTATGATCAT 
2 33 GCAGAGGCAG AAGCCAGGCA CCTGGTCTAT GAATCAGACC AAAACAAGGA TGGCAAGCTT 
2 34 ACCAAGGAGG AGATCGTTGA CAAGTATGAC TTATTTGTTG GCAGCCAGGC CACAGATTTT 
2 35 GGGGAGGCCT TAGTACGGCA TGATGAGTTC TGAGCTACGG AGGAACCCTC ATTTCCTCAA 

236 AAGTAATTTA TTTTTACAGC TTCTGGTTTC ACATGAAATT GTTTGCGCTA CTGAGACTGT 

237 TACTACAAAC TTTTTAAGAC ATGAAAAGGC GTAATGAAAA CCATCCCGTC CCCATTCCTC 

238 CTCCTCTCTG AGGGACTGGA GGGAAGCCGT GCTTCTGAGG AACAACTCTA ATTAGTACAC 

239 TTGTGTTTGT AGATTTACAC TTTGTATTAT GTATTAACAT GGCGTGTTTA TTTTTGTATT 
24 0 TTTCTCTGGT TGGGAGTATG ATATGAAGGA TCAAGATCCT CAACTCACAC ATGTAGACAA 

241 ACATTAGCTC TTTACTCTTT CTCAACCCCT TTTATGATTT TAATAATTCT CACTTAACTA 

242 ATTTTGTAAG CCTGAGATCA ATAAGAAATG TTCAGGAGAG AGGAAAGAAA AAAAATATAT 
24 3 GCTCCACAAT TTATATTTAG AGAGAGAACA CTTAGTCTTG CCTGTCAAAA AGTCCAACAT 
244 TTCATAGGTA GTAGGGGCCA CATATTACAT TCAGTTGCTA TAGGTCCAGC AACTGAACCT 
24 5 GCCATTACCT GGGCAAGGAA AGATCCCTTT GCTCTAGGAA AGCTTGGCCC AAATTGATTT 
246 TCTTCTTTTT CCCCCTGTAG GACTGACTGT TGGCTAATTT TGTCAAGCAC AGCTGTGGTG 
24 7 GGAAGAGTTA GGGCCAGTGT CTTGAAAATC AATCAAGTAG TGAATGTGAT CTCTTTGCAG 
24 8 AGCTATAGAT AGAAACAGCT GGAAAACTAA AGGAAAAATA CAAATGTTTT CGGGGCATAC 
24 9 ATTTTTTTTC TGGGTGTGCA TCTGTTGAAA TGCTCAAGAC TTAATTATTT GCCTTTTGAA 
250 ATCACTGTAA ATGCCCCCAT CCGGTTCCTC TTCTTCCCAG GTGTGCCAAG GAATTAATCT 
2 51 TGGTTTCACT ACAATTAAAA TTCACTCCTT TCCAATCATG TCATTGAAAG TGCCTTTAAC 
2 52 GAAAGAAATG GTCACTGAAT GGGAATTCTC TTAAGAAACC CTGAGATTAA AAAAAGACTA 
2 53 TTTGGATAAC TTATAGGAAA GCCTAGAACC TCCCAGTAGA GTGGGGATTT TTTTCTTCTT 
2 54 CCCTTTCTCT TTTGGACAAT AGTTAAATTA GCAGTATTAG TTATGAGTTT GGTTGCAGTG 
255 TTCTTATCTT GTGGGCTGAT TTCCAAAAAC CACATGCTGC TGAATTTACC AGGGATCCTC 
2 56 ATACCTCACA ATGCAAACCA CTTACTACCA GGCCTTTTTC TGTGTCCACT GGAGAGCTTG 
257 AGCTCACACT CAAAGATCAG AGGACCTACA GAGAGGGCTC TTTGGTTTGA GGACCATGGC 
2 58 TTACCTTTCC TGCCTTTGAC CCATCACACC CCATTTCCTC CTCTTTCCCT CTCCCCGCTG 
2 59 CCAAAAAAAA AAAAAAAGGA AACGTTTATC ATGAATCAAC AGGGTTTCAG TCCTTATCAA 

260 AGAGAGATGT GGAAAGAGCT AAAGAAACCA CCCTTTGTTC CCAACTCCAC TTTACCCATA 

261 TTTTATGCAA CACAAACACT GTCCTTTTGG GTCCCTTTCT TACAGATGGG ACCTCTTGAG 

262 GAAGGAATTA TCGTATTC 

264 (2) INFORMATION FOR SEQ ID NO : 5: 

266 (i) SEQUENCE CHARACTERISTICS: 

267 (A) LENGTH: 331 amino acids 

268 (B) TYPE: amino acid 

269 (C) STRANDEDNESS : single 

270 (D) TOPOLOGY: linear 

272 (vii) IMMEDIATE SOURCE: 

273 (A) LIBRARY: GenBank 

274 (B) CLONE: 1262329 

276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

278 Met Ala Arg Gly Gly Arg Gly Arg Arg Leu Gly Leu Ala Leu Gly Leu 

279 1 5 10 15 

280 Leu Leu Ala Leu Val Leu Ala Pro Arg Val Leu Arg Ala Lys Pro Thr 

281 20 25 30 

282 Val Arg Lys Glu Arg Val Val Arg Pro Asp Ser Glu Leu Gly Glu Arg 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2658 
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